Are cluster validity measures (in) valid?

نویسندگان

چکیده

Internal cluster validity measures (such as the Calinski-Harabasz, Dunn, or Davies-Bouldin indices) are frequently used for selecting appropriate number of partitions a dataset should be split into. In this paper we consider what happens if treat such indices objective functions in unsupervised learning activities. Is optimal grouping with regards to, say, Silhouette index really meaningful? It turns out that many (in)validity promote clusterings match expert knowledge quite poorly. We also introduce new, well-performing variant Dunn is built upon OWA operators and near-neighbour graph so subspaces higher density, regardless their shapes, can separated from each other better.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cluster Validity Measures Dynamic Clustering Algorithms

Cluster analysis finds its place in many applications especially in data analysis, image processing, pattern recognition, market research by grouping customers based on purchasing pattern, classifying documents on web for information discovery, outlier detection applications and act as a tool to gain insight into the distribution of data to observe characteristics of each cluster. This ensures ...

متن کامل

Are validated outcome measures used in distal radial fractures truly valid?

OBJECTIVES Patient-reported outcome measures (PROMs) are often used to evaluate the outcome of treatment in patients with distal radial fractures. Which PROM to select is often based on assessment of measurement properties, such as validity and reliability. Measurement properties are assessed in clinimetric studies, and results are often reviewed without considering the methodological quality o...

متن کامل

Are common measures of dietary restraint and disinhibited eating reliable and valid in obese persons?

Disordered eating measures were developed and validated in primarily normal weight samples; thus, it is unclear if the psychometric properties are equivalent across weight groups. This study evaluated the reliability and validity of self-reported disinhibited eating and dietary restraint measures in a community-recruited sample of overweight individuals (N = 201) and obese individuals (N = 101)...

متن کامل

Is the Notion of Validity Valid in HCI Practice?

Much attention has been paid in the recent literature to the notions of validity, thoroughness, and effectiveness of different Usability Evaluation Methods (UEMs). Calculation of these makes sense if a study aims to compare UEMs, but not, it is argued here, if a study aims to evaluate a given application. Illustrated by a case study employing different UEMs, it is argued here that for practitio...

متن کامل

Are there valid proxy measures of clinical behaviour? a systematic review

BACKGROUND Accurate measures of health professionals' clinical practice are critically important to guide health policy decisions, as well as for professional self-evaluation and for research-based investigation of clinical practice and process of care. It is often not feasible or ethical to measure behaviour through direct observation, and rigorous behavioural measures are difficult and costly...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Information Sciences

سال: 2021

ISSN: ['0020-0255', '1872-6291']

DOI: https://doi.org/10.1016/j.ins.2021.10.004